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ABSTRACT 


A FORTRAN IV computer program was employed to conduct a 
statistical analysis of data collected during fleet anti- 
submarine warfare exercises. The object of the investigation 
was the identification of those variables which had greatest 
influence ona destroyer's ability to detect a submarine 
tmaer certain conditions. 

The variables were treated as a random vector arising 
from one of two multivariate normal populations with common 
covariance matrix. An artificial regression relation was 
formulated to facilitate development of a linear discriminant 
function in a subset of those variables found to be of dominant 
importance. This latter subset was identified by examination 
of multiple correlation coefficients. 

The discriminant function was found to be seventy five 
per cent effective in classifying the experimental data 


camrectly . 
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I. INTRODUCTION 


The sonar-equipped-.destroyer is the primary surface 
vessel employed by the U.S. Navy in antisubmarine warfare. 
Detection of the submarine necessarily precedes any offen- 
Sive action the destroyer can take against it. It was the 
Sogective Of the author to identify those variables which 
had the greatest influence on the effectiveness of a 
destroyer in detecting a submerged submarine during fleet 
exercises. Consideration was limited to detection capabil- 
ities while using sonar equipment in the active mode. 

Reference 2 lists many of the large number of variables 
which influence the effectiveness of a destroyer in submarine 
detection. Certain of these variables can be regarded as 
control variables in the sense that the value of those vari- 
ables can be chosen and fixed by the destroyer commander with 
a certain degree of freedom. For example, the destroyer's 
Heeeammay Lange from zero to thirty knots. Furthermore, a 
particular destroyer can be equipped with one of Several types 
of sonar system, and can be manned with differing numbers of 
POomemcecnnicians Of Various Skill levels. These latter 
ee serve as examples of control variables whose value 
can be adjusted by agencies exogenous to the ship. Similar 
comments can be made concerning the submarine involved in an 


encounter with a destroyer. No attempt was made by the author 






to consider any variable as a control variable; all were 
regarded as completely random variables. 

Based on personal experience in the Destroyer Force, the 
author chose a set of twenty-two variables whose examination 
was expected to be AOS fruitful for the purpose stated. 

Four groups of variables were considered: environmental 
factors, destroyer characteristics, submarine characteristics, 
and tactical factors. “Table I lists the variables included 
in each group. Qualitative variables, such as destroyer hull 
type were asSigned numerical codes. All values were recorded 
at the time detection was achieved, or - in those cases where 
detection did not occur - when the submarine had approached 
to a specified range. This range was specified by the U.S. 
Navy Antisubmarine Warfare Data Center (NADAC) as a standard 


for each sonar system. The standards are listed in Ref. 2. 


Table I. Variables) by Group 


—_—_—S a 


Enwalrconmental PESEROver Submarine aed e ok 


Geographic Hull type Hull type Relative speed 

Pocation between 
OpOenemeEs 

Present weather Hull number Hull number Distance between 
OPpponenes 

Sonic layer Sonar type Depth Aspect angle of 

depth submarine 

Wind speed Speed Speed Relative bearing 


from ce Siei@ eG 
to submarine 


Sea state Self noise 
Surface turbu- Technician com- 
lence plement 

Bathy thermo- 

graph profile 

Ey De 


Water depth 








The data examined was supplied by NADAC on request, having 
been collected during sixteen fleet exercises conducted in 
the Atlantic and Pacific Oceans during the period 1962 to 
ioo,etnelusive. The destroyer types considered included DLG, 
DL, DDG, DDR, DD, DEG and DE. Each destroyer was equipped 
with one of the following AN/SQS sonar systems: 23, 26, 29, 
or 30. Submarine types encountered included all types in the 
eoewec leet during this period, ranging from the World War II 
vintage fleet submarine to the nuclear-powered ballistic 
missile submarine. 

Each observation of the twenty-t:wo variables was regarded 
Pomoncandem vector, X, arising from one of two multivariate 
normal populations with common covariance matrix: N(m,, V), 

Ox N(m.,, V). The former distribution representing an encounter 
which resulted in detection of the submarine by the destroyer, 
and the latter distribution applying otherwise. 

Maximun likelihood estimators were used for the mean 
vectors, covariance matrix, and all other parameters employed 
in the analysis. 

a FORTRAN IV computer program was prepared for analysis 
of the data and was utilized with the U.S. Naval Postgraduate 
School IBM-360 computer. A discussion of the program, a flow 
chart, and the program appear below. 

In all, observations from ninety two encounters were 
included in the analysis. Twenty-seven encounters resulted in 


Submarine detection by the destroyer, and sixty-five did not. 





II. DISCUSSION 


peeeee RE NCIPAL STEPS OF THE ANALYSIS 

Enestollowing discussion 1s conc2rned with the principal 
concepts employed in the analysis, without regard to the 
computer programming techniques employed. A full discussion 
of the program appears later. 

moes linear discriminant function 


enlee v- (m 


—- J i a 


5) - Fm, +m)! VO (m)- mm) = 0 (1) 


is related to the artificial regression proposed by Fisher 


iinet. 5 
web ix. — Cc). (2) 
. a ait) aad 
In the foregoing, Be is the observation vector with j = 1, 
3 a | ao there are n. observations in the first 


1 De 1 


population with parameters (m5, V) and n. obServations in 


2 


the second population. The artificial dependent variable, 
Yass takes the value no/(n, + n.) ets ase is from population 
one, and the value -n,/(n, + n,) otherwise. Wels WvSregend © 


is a constant vector with each element being the weighted 
ST ll 


Beuatlomei2yeyields artificial regression coefficients, 


grand mean of observations, l.e., ( ta no x5) /(n, a5 n,)- 


b, which are proportional to the discriminant function coef- 


ficients, yt (ee 0 of equation (1). (This may be 


1 m.. ) f 


verified by solution of the normal equations in the components 


Oe 2) )) 





Based on the above relationship between the discriminant 
and regression functions, equation (2) was utilized to 
determine a subset of variables which could be regarded as 
meg Of Principal importance in the discriminant function. 
The technique smalosed Ss discussed in Ref. 7. A multiple 
correlation coefficient, Ray was calculated from equation 
(1) using the full set of twenty-two independent variables. 
Then, each independent variable, X., Was eliminated in 
iterative fashion and a corresponding multiple correlation 


coefficient, R ,: Was calculated based on the remaining twenty- 


2. 
one independent variables. The significance of each of the 
twenty-two variables in the regression was then tested 
according to the following scheme. If the following relation- 


ship was satisfied, then the deletec variable, X., Was Gsi@anacl— 


ed as contributing little to the recression: 


q(Ry4 ~ Ro4)/(1 - Ry4) (py - Po) SF. (2a) 


In this relationship P, was the number of variables in the 
Original regression (22), P. was the number in the reduced 


set (21), F was the F variate with (py = Po) and gq degrees 


05 
of freedom, respectively; gq had the value (n, + no) 
(Dy we lle 

The above technique revealed that all but six of the 
original variables contributed little to the regression, and - 
therefore ~— to the discriminant function. The value of Ri 


was found to be 0.646 for the original variable set and had 


Pim uluecmOno26 Lor the Six Significant variables. The latter 





value was calculated from a regression based on these six 
variables aione. That is, with a seventy-three per cent 
reduction in the number of independent variables, the corre- 
lation coefficient was reduced by only eighteen per cent. 
Table II lists the atin sdepad2ileene variables in order of 
importance, the value of the correlation coefficient for the 
set of twenty-one variables remaining upon deletion of each 
of the significant variables, and the reduction in the multiple 
correlation coefficient which resulted upon deletion of the 
variable. 

With the six significant variables thus identified, a 
linear discriminant function was constructed, and its properties 
tested. The specific form of equation (1) used at this point 


was 


hole 


mead ~ =k = 0, (3) 


h 


where Lik Te gE hl Qe seavatlen vector, d 1s the vector 


Gemotsecriminant coefficients and k is a constant. The relation 
between equations (1) and (3) is given by 


dad = yt (m 


ee 


>) 


Seem) vee (m, - m.) 


With the value of all parameters calculated, equation (3) 


took the following form: 


—0.165x, + 0.015x, 4 0.411x, = 0.020x, + 0.506x, 
a 


- 0.008x, > 8.28 


i. 





Rage Limweotonificant Variables. 


Per cent Reduction 


- a Se. in Correlation 
5 Type of weather 502 2 
2 Submarine hull number 0.604 G.5 
6 Sonic layer depth 0.606 6.2 
i Geographic location 0.608 5.9 
3 Submarine hull type io aZ 5.3 
4 Submarine depth ONG ay 4.5 


An observation would ke classified as arising from population 
one if the relation was satisfied, and as arising from popula- 
tion two otherwise. With the formulation of equation (4), the 
observations were then to be classified using the equation, 
and the results achieved to be considered a measure of the 
Veeder y Of the discriminant function. 

Anderson discusses in Ref. 1 the probability of mis- 
classification associated with a linear discriminant function. 
Under the assumption that the cost of an error of misclass- 
ification is equal for each of the two possible errors, the 
pECoabality Of misclassification, P(M), is given by 


EM = P(Y > Va/2 } 


where Y iS a Standard normal variate and a takes the value of 


(my = ma) wo (m5 ieee rom the above equation, it was 


2 
found that the value of P(M) associated with equation (4) is 
O.13% 


Upon classifying the ninety-two observations of the six 


Significant variables - using equation (4) - it was found that 
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eight (30%) of the twenty-seven cases where detection occurred 
were misclassified as cases of non-detection. Similarly, 
fourteen (22%) of the sixty-five cases of non-detection were 
misclassified. Thus, the overall error of misclassification 
was twenty-five per gene. 

In view of the small sample sizes available, particularly 
in the case of population one (twenty-seven observations), the 
difference between the theoretical error of misclassification 
and the observed value was considered acceptable. The fact 
micemene Crror Of misclassification in each population 
decreased with sample size was viewed as an encouraging Sign. 

Conseguently, it was concluded that the six variables 
cited above were, in fact, of dominant importance in destroyer - 
submarine encounters. 

It was noted with interest that the character of these 
variables is such that the destroyer commander has virtually 
no means available to influence them to his advantage. There 
seems to be little direct action that could have been taken 
by the destroyer commander to increase significantly the like- 
lihood of detecting his submerged opponent. 

On the other hand, the submarine commander enjoys a distinct 
advantage. If his depth and location are chosen judiciously, 
and his submarine is of the preferred design, he may proceed 


with confidence of not being detected by his surface adversary. 


B. THE COMPUTER PROGRAM 
The computer program employed in the analysis contained 


four principal steps. First, the data were read into storage, 
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Constants were defined, and the data were transformed into a 
form to facilitate subsequent calculations. Second, the 
mebtiple correlation coefficient for the twenty-two independent 
variables was calculated. Third, using the scheme described 
above, ae silemiPieane “ercieisice WweLtemrmG@entitred. Pinally, the 
discriminant function was formulated and tested. The discus- 
Sion below is concerned with the mechanics and logic of the 
entire program. A flow chart of the program is included as 
Appendix A, and is followed by a listing of the program itself. 

The following subprograms which were prepared by the 
Seiputer Manufacturer formed a substantial part of the program: 
Seer ORDER, MINV, MULTR, DMATX. A full discussion of these 
subprograms may be found in Ref. 6. They are discussed below 
briefly where necessary to maintain continuity. 

After storage requirements and constants are defined in the 
main program, control is passed to the subroutine DATA. Here, 
a DC loop is used to read the raw data into storage as a 
three-dimensional array X(I,J,K). (The indices of X are 
observation number, variable number, and population number 
respectively.) As each card of data is read, the values of 
four variables are calculated and the data are transformed to 
a one dimensional array XD. The latter step is required by 
PQemeUPseguent use Of the manufacturer-supplied subprograms 
mention above. 

iiewirrst value calculated is that of the dependent var- 
rable Ye Vheewmmices-Oned an X({1I,23,K). Next the relative 
bearing from destroyer to submarine is converted to 


radian measure using the simple subprogram REL. With destroyer 
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course and: submarine true bearing aSinput variables, REL 
calculates the relative bearing as the output variable RB. 
This value is then converted to radian measure and stored in 
X(I,12,K). Then, relative speed is computed from destroyer 
speed, submarine speed, and the angle between the two vessels' 
courses. This value is stored in X(I,5,K). The value of 
sonic layer depth is stored in X(1I,22,K) based on the fleet 
exercise from which the observation was drawn. Finally, the 
destroyer's manning level in the sonar technician (ST) rating 
is used to calculate a parameter reflecting the rate distri- 
bution. This is accomplished by calculating a sum of weighted 
values and storing the result in X(I,13,K). For each destroyer, 
Eeemnumoer of Chief petty officers is multiplied by six, first 
class petty officers by five, and so on, until a product is 
Boermed for all skill levels in the ST rating. The sum of these 
products is then taken as the measure of manning level for that 
destroyer. 

With the data transformation completed, control returns 
to the main program and the multiple linear regression 
equations are formulated. The first subprogram called is 
CORRE. Given as input variables the number of observations 
NS, the number of variables M, and the one dimensional array 
of observations, XD - CORRE computes the vector of means 
CMeeieetne vector of Standard deviations PS, the MxM matrix of 
sums of cross-products of deviations from means D, and the 
upper triangular portion of the MxM symmetric matrix of 


Cormuciotion cCoertricients R. 


lle 





After the subscripts of the variables are stored in the 
array ISAVE in the order in which the variables are arranged 
in the array XD, control is passed to subroutine ORDER. 

Given the number of variables, the matrix of correlation 
coefficients, the pence ips number of the dependent variable, 
and the array ISAVE, subroutine ORDER computes the 22x22 matrix 
of correlation coefficients among independent variables Raye 
and the vector of correlation coefficients between the dependent 
variable and the independent variables Rio: 

Next, the subroutine MINV is called to calculate the 
inverse of Ria: This matrix plus tke vector of means, the 
vector of standard deviations and the diagonal elements of 
the matrix D are then passed to the subroutine MULTR. This 
Subprogram computes the vector of regression coefficients W 
mmemene multiple correlation coefficient ANS(2). (The vector 
Meee ANs(2)) correspond to the vector b of equation (2) and 
the variable Ray of the relation (2a) discussed earlier.) 

A DO loop is then used to eliminate each independent 
variable from the regression, and twenty-two multiple corela- 
PEIOMmCoctrVTclents for Cach combination of twenty-one variables 
is computed. The test statistic of relation (2a) is computed 
for each variable a and stored in C(J). As each new 
iemolemcerprelarton Coefficient 1S calculated, the value of 
the test statistic is stored temporarily in TEM3 for each 
test. This value is then compared to the value of the F(1,69) 


variate at the 5% significance level. If the variable with 


Pieeemipeniels tound to be of little Significance, then a zero 
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1s stored in array MK, otherwise MK(J) will contain the value 
of the subscript. 

On completion of the above, the values of the test sta- 
tistic are arranged in order of magnitude in the array C, and 
deleted from the array IGAVB. 

At this point the number of Significant variables is 
checked. This step is necessitated by the fact that the array 
TST contains values of the F(f,,f,) statistic with f = 1 and 
os f. See come Enc eStactistic as used later in the program and 
the results of the program would be invalid if the number of 
Significant variables were not within the range given for f.. 
(The range chosen was based on preliminary trials with the 
program. It was expected that there would be six Significant 
variables.) 

The subprograms ORDER, MINV and MULTR are employed as before 
with the regression equations being based on the significant 
variables only. Deletion of the indices of the insignificant 
variables from the array ISAVE results in the appropriate 
adjustments in the subprograms. The test statistic TEM3 is 
compared to the appropriate F variate TST(IT), and the index 
iommmerementea. lf the test is not satisfied, i.e., 1f the 
discarded variables have significance as a group (an unexpected 
result), a value of one is stored in ICK and this will result 
Mmecamcnange Of the flow of the program later. 


A DO loop follows the above and rearranges the order of 


the variables in the array XD in preparation for calculation 


ifs: 





mene Giscrciminant LuUNCcCtion. Control then passes to the 
Subroutine DMATX. 

Given the number of (Significant) variables L, the number 
of populations LX, the number of observations in each popula- 
tion N, and the array of data XD - DMATX calculates the vector 
of means XBAR and the common covariance matrix D. The inverse 
of Dis then calculated by subroutine MINV and control passes 
to subroutine DISC. This latter subroutine is a simple 
FORTRAN IV equivalent of equation (1). 

Next, each observation is classified using the discriminant 
function just formulated. The value of the function is tested 
against zero and the observation classified accordingly. For 
each population, the value of KOUNT is incremented each time 
an observation is improperly classified. 

After all observations have been classified, the value of 
ICK is checked. As discussed above, if ICK has value one, the 
previously discarded variables have group significance. If 
mits 1S the case, a warning to that effect is printed, the 
index of the most significant deleted variable is added to the 
array MK, control returns to statement number 45, the new 
group of significant variables are tested and a new discriminant 
minecrmlon 1s found. If ICK has a value of zero, the discrim- 
inant function coefficients are printed and the program 


Eonminates . 
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III. CONCLUSICN 


Analysis of data collected during fleet exercises conducted 
during the period 1962 to 1966 inclusive strongly suggests 
that the variables listed in Table II had a dominant influence 
on destroyer - submarine encounters. Specifically, the 
successful detection of a submarine by a destroyer utilizing 
a sonar of the type specified in the active mode was influenced 
more strongly by these six variables than by the remaining 
Sixteen considered. 

The discriminant function, equation (4), was 75% effective 
in classifying encounters and shoulc be useful as a predictive 
tool in assessing the likelihood of detection under known 
environmental conditions and against. known submarine types. 

Consideration of the variables fiound to be Significant 


indicates that the tactical advantage was with the submarine 


commander during encounters of the type considered. 


ee, 





FLOW CHART OF COMPUTER PROGRAM 


START 


Der OEN I 


STORAGE 


READ LX, 
M, N 


ICK=0 


KI=M-1 


NS=N (1) 
+ N(s) 





DEFINE 
ARRAY 
lipcyd b 


CALL DATA 





LSAVE (1) 





2 





CALL ORDER 


CALL MINV 


CALL MULTR 


WRITE W / 
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DN = 
CET EMI) 7/69 


KD=KI-1L1 





2H 








JS 01 





ISAVE (L) 
= JI 


CALL ORDER 


CALL MINV 
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©) 


CALL MULTR 


PRINT MULT. CORRELATION 
CCEE ETC FOR JTH 
Gr OUrnOme |) VARIABLES 


TEM2 


=ANS (2) **2 










TEM3= 
(TEM1-TEM2 
/ DN 


eS 


C(J)=TEM3 
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< 
J=J+1 |. J:KI >< 
a 
ila 


() SR Ga ) 
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KL=KL+1 





> | STR=C (KL) 





IND=KL 


eA | 
| 


IDX (KJ) =INDIe—— ~ Sica 






KJ=KJ+1 





JJ=1 


MK (JJ) :0 


L=L+1 


ISAVE (L) 


=MK (JJ) 





60 


do -K1 : JJ=JI+1 


(0 ee 
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Vet loAVEs( LNYDECES OF 
SPGNTPEWeANT VARDABLES} 


aii 





TEM2 = 
ANS (2) **2 


TEM3= (TEM1L- 
TEM2) / 
(DN* (KI-L) ) 





JERSE die ae 


EM3: 
JESU alae 
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TCK 


— 
— 


i 





NKEN (K) 


N2=LK-NK 





ISAVE (IE) 
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N2=N2+NK 











XD(N2) = 


Retin yk) 


fE-E 


JK=JK+1 





K=K+1 





CALL DMATX 








WRITE XBAR(II), XBAR(JxX) 
(VECTORS OF MEANS) 


CALL MINV 
\ 
CAL DiSe 








KountT=0 [* ia) 





a 


=] 


re 
il 





vy 
IND=0 


JL=1 
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IG = 
SAY Es (oily) 
IND=IND+1 


SUM=SUM+ 
Cre (11, 


IG,KN) 


KD < queef 


oe 


2 
> 








WRITE 
KOUNT , KN 
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Ox LrOry-* 


KOUNT= 
KOUNT+1 





KN=KN+1 


LK=L+1 








WRETE OC, VECTOR OF 
DISCRIMINANT FUNCTION 
COEFFICIENTS 


ND=IDX (L+1 
MK (IND)=IND 
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COMPUTER PROGRAM 
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